Juan/sc 10379/list return types in the list tests output by johnwalz97 · Pull Request #383 · validmind/validmind-library

johnwalz97 · 2025-06-20T14:19:44Z

Pull Request Description

What and why?

This PR adds two columns to the list test output, specifically 'has tables' and 'has figures'. These columns let users know what to expect in the test output before they run it.

How to test

To test, open a notebook and run this snippet of code:

from validmind.tests.load import list_tests

# len(list_tests(pretty=False))
list_tests()

What needs special review?

Please let me know what you think about the way in which the information is presented, specifically via flags. We could combine the information on outputs into two columns.

Dependencies, breaking changes, and deployment notes

There should be no breaking or logical changes besides the additional columns since the majority of changes are just return type annotations.

Release notes

Added columns in the list_tests() output to show what kind of artifacts each test in the Valid Mind library produces.

Checklist

…tests-output

…dation and metric functions

…dation functions

…tests-output

- Added support for inspecting return types to determine if they include figures or tables. - Introduced new type hints for Matplotlib and Plotly figures. - Updated the test listing function to include flags for presence of figures and tables in the output.

juanmleng · 2025-06-25T11:20:33Z

This looks very nice! I think outputting two flags makes the result clear and easy to interpret. However, I have a couple of comments on that:

Currently, when a test does not include a return type annotation, we default both flags to False. However, this could be confusing in edge cases where the test only outputs text. To make this more transparent, we could consider setting the output to "unknown" in such cases and issuing a warning to inform users that the test lacks a return type.
Related to the above, when a test returns a string, we don’t explicitly indicate that. While this is a bit of an edge case, it is still a valid scenario. One possible approach is to treat tests with no return type annotation as "unknown" and leave both flags as False. This would implicitly suggest that the test returns a string, although it's not very transparent.
Adding a third flag such as has_text might be overkill for this rare case. On the other hand, doing so would also document the "art of the possible" by acknowledging that tests can return text. That said, this is likely not common enough to strongly influence the design.
It would be perhaps quite useful to also include dedicated filters to list tests by has_figure and has_table.

Example:

johnwalz97 · 2025-06-25T15:21:33Z

This looks very nice! I think outputting two flags makes the result clear and easy to interpret. However, I have a couple of comments on that:

Currently, when a test does not include a return type annotation, we default both flags to False. However, this could be confusing in edge cases where the test only outputs text. To make this more transparent, we could consider setting the output to "unknown" in such cases and issuing a warning to inform users that the test lacks a return type.

Related to the above, when a test returns a string, we don’t explicitly indicate that. While this is a bit of an edge case, it is still a valid scenario. One possible approach is to treat tests with no return type annotation as "unknown" and leave both flags as False. This would implicitly suggest that the test returns a string, although it's not very transparent.

Adding a third flag such as has_text might be overkill for this rare case. On the other hand, doing so would also document the "art of the possible" by acknowledging that tests can return text. That said, this is likely not common enough to strongly influence the design.

It would be perhaps quite useful to also include dedicated filters to list tests by has_figure and has_table.

Example:

Hmm, good points.

So for any custom test that doesn't have return type annotations, we are not going to be able to tell what the outputs are. I can clarify the warning message to make it a little bit more clear what users should do. And I can set the column to "Unknown" like you mentioned.

As far as adding a Has Text column, I think that might be overkill for now but definitely something we can keep on the back burner.

AnilSorathiya · 2025-06-27T14:06:51Z

@johnwalz97 @juanmleng and @cachafla
Just wondering, how about return types of tests display in a single column where output types will be listed as values in a list form? similar to "required inputs" -> ["dataset", "model", "models", ...]
for example,
"Output types" column in a table:
[table, figure, text,...]

It will allow us to add more return types without disturbing list_test().

johnwalz97 · 2025-06-27T15:28:16Z

@johnwalz97 @juanmleng and @cachafla Just wondering, how about return types of tests display in a single column where output types will be listed as values in a list form? similar to "required inputs" -> ["dataset", "model", "models", ...] for example, "Output types" column in a table: [table, figure, text,...]

It will allow us to add more return types without disturbing list_test().

That's not a bad idea. The only thing is it does make it a bit more difficult to filter the data frame down to only tests that have tables or only tests that have figures. but it would definitely make the table a bit more compact.

The thing is right now, it's set up to only be able to tell if the test outputs a table or figure. Shouldn't be that difficult to add detection for the other types. It's just a question of if we want it. Let me know what you guys think.

cachafla · 2025-06-30T23:37:16Z

@johnwalz97 how does the user filter the list of tests for "show me tests that output figures"?

…tests-output

johnwalz97 · 2025-07-02T14:53:14Z

@johnwalz97 how does the user filter the list of tests for "show me tests that output figures"?

Right now, they would have to programmatically filter the list that's returned if they don't use the pretty argument, or they would have to get the DataFrame from the Styler object and filter that. There isn't a parameter right now for filtering, but do you want to add that as part of this ticket?

github-actions · 2025-07-17T22:13:15Z

PR Summary

This pull request introduces extensive updates across the codebase by adding explicit return type annotations and updating function signatures with appropriate type hints. The changes span multiple modules and directories including tests, ongoing monitoring, prompt validation, and unit metrics for both classification and regression.

Key enhancements include:

Insertion of return type annotations for functions that previously lacked them, ensuring clearer contracts between functions and improving static analysis.
Addition of necessary imports from the typing module (e.g. Tuple, List, Dict, Optional) and other related modules to support the type annotations.
Overall alignment of function signatures across various test modules (model validation, ongoing monitoring, prompt validation) and unit metrics, which helps in improving code readability and maintainability.
Consistent update of functions' return types such as using Tuple for multiple return values and float for scalar metrics.

These changes do not alter the existing logic or functionality. Instead, they serve to improve the code quality, ease future modifications, and facilitate better static type checking and code analysis.

Test Suggestions

Run the full suite of unit tests and integration tests to verify that no functionality has been altered due to the introduction of type annotations.
Use a static type checker (e.g., mypy) to ensure all type hints are correctly applied and there are no inconsistencies.
Manually review key functions in test modules for runtime behavior, particularly where complex return types (e.g., tuples with multiple types) are used.
Execute performance tests to confirm that the changes in annotations have no adverse effects on runtime performance.

cachafla

I updated notebooks/how_to/explore_tests.ipynb. I suggest we merge this and address improvements later (like filtering or adding something to describe_test) since this PR makes changes to every test.

juanmleng and others added 8 commits May 15, 2025 14:56

Output return annotation

077a241

Update data validation tests

31f2f62

Fix data validation tests

f40af74

Merge branch 'main' into juan/sc-10379/list-return-types-in-the-list_…

326c8e1

…tests-output

feat: Adding type hints to unit metrics

bc70d02

feat: Add type hints for dataset and model parameters in various vali…

e330179

…dation and metric functions

feat: Add type hints for model and dataset parameters in various vali…

ed5f3b8

…dation functions

Merge branch 'main' into juan/sc-10379/list-return-types-in-the-list_…

f06d43d

…tests-output

johnwalz97 added enhancement New feature or request highlight Feature to be curated in the release notes 🚧 Work In Progress and removed highlight Feature to be curated in the release notes labels Jun 20, 2025

johnwalz97 added 3 commits June 24, 2025 10:53

chore: revert changes to notebook

b3f3a9a

chore: revert changes to load.py

cba07f6

johnwalz97 requested review from AnilSorathiya, cachafla and juanmleng June 24, 2025 15:24

johnwalz97 removed the 🚧 Work In Progress label Jun 24, 2025

Merge branch 'main' into juan/sc-10379/list-return-types-in-the-list_…

ef544e3

…tests-output

Update notebooks/how_to/explore_tests.ipynb

ee5a957

cachafla approved these changes Jul 17, 2025

View reviewed changes

johnwalz97 merged commit fe722e9 into main Jul 17, 2025
7 checks passed

johnwalz97 deleted the juan/sc-10379/list-return-types-in-the-list_tests-output branch July 17, 2025 22:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Juan/sc 10379/list return types in the list tests output#383

Juan/sc 10379/list return types in the list tests output#383
johnwalz97 merged 13 commits intomainfrom
juan/sc-10379/list-return-types-in-the-list_tests-output

johnwalz97 commented Jun 20, 2025 •

edited

Loading

Uh oh!

juanmleng commented Jun 25, 2025

Uh oh!

johnwalz97 commented Jun 25, 2025

Uh oh!

AnilSorathiya commented Jun 27, 2025

Uh oh!

johnwalz97 commented Jun 27, 2025

Uh oh!

cachafla commented Jun 30, 2025

Uh oh!

johnwalz97 commented Jul 2, 2025

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

cachafla left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

johnwalz97 commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Description

What and why?

How to test

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Checklist

Uh oh!

juanmleng commented Jun 25, 2025

Uh oh!

johnwalz97 commented Jun 25, 2025

Uh oh!

AnilSorathiya commented Jun 27, 2025

Uh oh!

johnwalz97 commented Jun 27, 2025

Uh oh!

cachafla commented Jun 30, 2025

Uh oh!

johnwalz97 commented Jul 2, 2025

Uh oh!

github-actions bot commented Jul 17, 2025

PR Summary

Test Suggestions

Uh oh!

cachafla left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

johnwalz97 commented Jun 20, 2025 •

edited

Loading